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ABSTRACT 


A three  state  optimal  control  law  had  been  Implemented 
in  the  missile  simulation  under  the  last  contract.  Before  a 
Kalman  filter  could  be  designed  and  implemented  on  the  6D0F 
simulation  to  reduce  the  effect  on  performance  of  random  distur- 
bances, a serious  problem  of  performance  sensitivity  had  to  be 
overcome.  To  this  end,  it  was  decided  that  the  control  problem 
had  to  be  reformulated  with  hard  constraints  on  the  controller. 

The  previous  control  problem  only  assumed  a 'soft  constraint' 
in  the  functional  J, 

tf  2 

J = {other  terms  at  t^}  + u (t)dt  , 

to  be  minimized. 

To  solve  this  control  problem,  a review  of  the  pertinent 
control  literature  was  conducted  which  is  given  in  Chapter  III. 

Our  control  problem  is  formulated  in  Chapter  TV  as  a minimum  time, 
minimum  fuel,  and  minimum  energy  problem.  The  solution  as  a 
minimum  time  problem  is  discussed.  The  computer  implementation 
in  a simple  four  state  simulation  is  given  in  Chapter  V with 
conclusions  and  recommendations  for  future  study  given  in  Chapter  VI. 
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Chapter  I 


INTRODUCTION 


Our  contractual  task  was  to  investigate  and  reduce  the 
effect  of  random  disturbances  on  missile  performance.  A guidance 
filter  was  to  be  designed  to  produce  smoothed  estimates  of  various 
states  which  in  turn  are  used  to  control  the  missile.  Primarily, 
the  line-of-sight  angle  X and  angular  rate  X along  with  the  attitude 
angle  0 and  angular  rate  output  0 are  needed  in  both  pitch  and  yaw 
channels.  Random  disturbances  which  are  inherent  in  measuring  the 
states  are  to  be  filtered  out  to  produce  the  desired  smoothed 
estimate . 


Although  the  three-state  control  law  had  been  implemented 
and  did  produce  an  acceptable  miss  distance,  the  implementation  was 
of  little  practical  value.  The  states  used  (Y^  and  Y.)  must  be 
replaced  with  others  (X  and  X).  Even  if  the  implementation  were 
altered  to  use  these  states  along  the  lines  suggested  in  the  contract 
report  [ 6 1 » a more  serious  problem  needed  to  be  overcome.  Small 
changes  in  initial  conditions  or  in  control  parameter  values 
produced  unacceptable  changes  in  miss  distance  and  attitude  angle  at 
impact.  The  control  law  was  extremely  sensitive  to  such  changes 
and  behaved  in  a highly  discontinuous  manner. 

Before  work  could  begin  on  the  design  of  the  Kalmer  filter, 
it  was  essential  to  resolve  the  sensitivity  problem.  To  this  end, 
two  methods  seemed  to  offer  some  promise.  The  first  was  to  try  to 
fly  the  missile  with  a four-state  controller  which  was  based  on 
previous  work  done  by  Pastrick  and  York  [9].  Basically,  the  four 
state  controller  assumes  a first  order  lag  model  for  the  autopilot; 
whereas  the  three-state  controller  assumes  instantaneous  autopilot 
response.  Chapter  II  discusses  this  work. 

The  second  method  used  to  overcome  the  sensitivity  problem 
was  to  investigate  reformulating  the  control  problem  with  hard 
constraints  on  the  controller  of  the  form: 

| u(t) | < M . (1.1) 

The  previous  work  only  assumed  a soft  constraint  with  a weighted 
integral  of  the  square  of  the  control  u in  the  functional  J to  be 
minimized,  i.e. 

tf  2 

J = {other  terms}  + b/q  u (t)  dt 

To  accomplish  this  goal  of  reformulating  the  control 
problem  as  one  with  hard  constraints,  a review  of  the  control 
literature  was  undertaken.  Chapter  III  describes  the  major  types 
of  control  problems:  minimum  time,  minimum  fuel,  and  minimum  energv. 

A simple  two-state  system  is  used  to  compare  and  contrast  the  three 
different  types  of  control  laws. 
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2.1  Performance  Sensitivity 


The  three-state  control  law  had  been  implemented  in 
the  6D0F  missile  simulation  under  the  assumption  of  perfect 
knowledge  of  the  states- -Y,,  Y , and  6 (see  Figure  4.1  and 
Table  4.1  for  a detailed  description  of  these  states).  In 
the  actual  system,  however,  the  line-of -sight  angle  X and 
angular  rate  are  available,  not  Y^  and  Y^.  This  meant  that 
the  controller  had  to  be  reformulated  in  these  variables  along 
with  0.  One  method  for  doing  this  was  outlined  in  the  contract 
final  report. 

A far  more  serious  problem  existed  with  the  control 
implementation  than  the  above,  and  that  was  a problem  of  per- 
formance sensitivity.  Small  changes  in  initial  conditions  or 
in  the  control  parameters  would  produce  unacceptable  changes 
in  miss  distance,  as  shown  in  Table  2.1. 

Table  2.1.  6D0F  Simulation  Runs  for  Various  Initial  Conditions 


Y.(t  ) 
a o 

w 

2 

W 

RXE 

RZE 

e(tf) 

-5000. 

-1500. 

-5. 

1090.856 

.3 

-10.4 

-38.1 

-4500. 

-1500. 

-5. 

1090.856 

1.6 

-68.9 

-44. 

-5500. 

-1500. 

-5. 

1090.856 

-1065.3 

1.2 

-77.5 

-5000. 

-1400. 

-5. 

1090.856 

-690.7 

.5 

-47.2 

-5000. 

-1600. 

-5. 

1090.856 

-116.7 

1.5 

-89.7 

-5000. 

-1500. 

-4. 

1090.856 

.7 

-150.4 

-32.4 

-5000. 

-1500. 

-6. 

1090.856 

-495.1 

.9 

-41.8 

-5000. 

-1500. 

-5. 

1085.856 

.9 

-127.1 

-35.5 

-5000. 

-1500. 

-5. 

1095.856 

-448.7 

1.2 

-43.4 

RXE  and  RZE  are  the  coordinates  of  the  point  on  the  trajectory 
where  the  missile  is  closest  to  the  target. 


2.2  Four-State  Controller 


To  overcome  the  sensitivity  problem,  it  was  decided 
to  investigate  using  a four-state  controller  previously  developed 
by  Pastrick  and  York  [9].  The  pertinent  details  of  this  work 
appear  as  Appendix  A in  the  contract  report  [16].  .Basically, 
our  mathematical  model  consists  of  four  states  Y , Y^,  0,  and 
to  describe  the  missile  - target  geometry  in  a plane.  The 
differential  equations  describing  the  dynamics  are 

V*d 

Yd  = -A^osQ 
\ = -wlAl  + Klu 

0 = K u . 
a 

The  contract  work  made  the  assumption  that  the  lag 
in  the  autopilot  could  be  ignored  and  employed  only  a 'soft 
constraint'  on  the  controller  u in  the  form  of  an  integral  of 
u^  in  the  cost  functional  to  be  minimized.  By  ignoring  the 
autopilot  lag,  the  system  could  then  be  described  by  just  three- 
state  variables.  It  is  difficult  to  say  with  much  precision 
which  assumption  causes  the  sensitively  problem,  but  it  is 
probably  the  latter. 

In  the  hope,  however,  of  reducing  performance 
sensitivity,  it  was  decided  to  replace  the  three-state' controller 


(2.1) 

(2.2) 

(2.3) 

(2.4) 


u - cl(t)Vd 


C2(t)Yd  + C3(t)0 


(2.5) 


with  the  four-state  controller 


u = d1(t)Yd  + d2(t)Yd  + d3(t)0  + d4(t)AL  . (2.6) 

This  four-state  controller  is  considerably  more  complex  than 
the  three-state  one,  but  it  gives  better  performance  over  a 
wider  range  of  autopilot  lag  values.  See  [9],  [16]  for  compari- 
sons in  performance  using  the  two  controllers. 


2.3  Computer  Simulation  Results 

The  four  state  simulation  with  the  four  state  controller 
has  not  been  made  to  work.  At  first,  there  was  a coding  error, 
but  this  has  since  been  found  and  corrected.  The  main  problem 
has  been  a lack  of  access  to  the  computer  facility  at  the  University 
of  Kentucky.  During  the  first  part  of  this  contract,  the  telephone 


4 


v 


lines  used  for  transmission  were  out  of  order.  With  the  telephone 
maintenance  men  on  strike,  this  situation  was  not  remedied  for 
over  a month.  By  that  time,  the  decision  had  already  been  made 
to  pursue  the  reformulation  of  the  control  problem.  A successful 
four  state  implementation  would  have  been  preferred,  but  the 
author  does  feel  that  the  final  result  would  have  been  negative, 
that  is,  the  sensitivity  problem  would  be  present  with  the  four 
state  controller  as  was  with  the  three  state  one.  The  extreme 
sensitivity  shown  in  Figure  2,1  makes  this  conclusion  almost  a 
certainty. 
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Chapter  III 


OPTIMAL  CONTROL  THEORY  REVIEW  FOR 
BOUNDED  CONTROLLERS 

3.1  The  Unconstrained  Control  Problem 

The  unconstrained  optimal  control  problem  is  to  find  an 
admissible  control  u*  that  causes  the  sys  'em 

x ( t ) = a(x(t) ,u(t),t)  ( 


(3.1) 


to  follow  an  admissible  trajectory  x that  minimizes  the  performance 


measure 


J(u)  = h(x(tf),tf)  + /tfg(x(t) ,u(t) ,t)dt, 


(3.2) 


where  x is  the  n x 1 state  vector  and  u is  the  m x 1 vector  of  control 
inputs.  The  problem  assumes  no  bounds  on  either  the  state  or  control 
regions. 

The  solution  to  the  problem  is  given  by  first  defining  the 
Hamiltonian  (H) , 

_T 

H(x(t) ,u(t) ,p(t) ,t)  = g(x(t) ,u(t) ,t)  +p  (t) [a(x(t) ,u(t) ,t>  J.(3. 3) 

The  n x 1 vector  p(t)  is  the  vector  consisting  of  the  set  of  Lagrange 
multipliers^  The  following  system  of  differential  equations  are  then 
solved  for  x,  p,  and  u: 


* 

X ( t ) = ^=(x(t)  ,u(t)  ,p(t)  ,t) 

• 3U  

p(t)  = - -^(x(t)  ,u(t)  ,p(t)  ,t) 
0 = -^=(x(t)  ,u(t)  ,p(t)  ,t) 
subject  to  the  two  point  boundary  conditions 

x(to)  = xq  , x(tf)  = xf  (if  given) 


(3.4) 


(3.5) 


(3.6) 


(3.7) 


'ah,-,  \ 

[ 3^(x(tf) ’ 


tf)  - p(tf)  6xf  + 


H(x(tf),u(tf),p(tf),tf)  + |^(x(tf ) ,tf ) 


(3.8) 


6t  = 0. 


Equations  (3.4),  (3.5),  and  (3.8)  are  referred  to  as  the  state  equations, 
costate  equations,  and  the  transversality  condition,  respectively. 
Equations  (3.4),  (3.5),  and  (3.6)  are  necessary  (not  sufficient) 
conditions  for  an  optimal  controller. 


V/ hen  constraints  are  placed  on  the  controller,  then  the 
controller  u*  where  Equation  (3.6)  is  satisfied  may  very  well  lie 
outside  the  admissible  region;  in  which  case,  the  partial  derivative 
is  undefined.  Another  approach  is  needed,  and  this  is  provided  by 
Pontryagin's  minimum  principle. 


3.2  The  Constrained  Optimal  Control  Problem 

When  a constraint  on  the  control  region  is  imposed,  for 

example , 

”i  - I ui C c ) | £ 6i  » i = l,...m  (3.9) 

then  Pontryagin's  minimum  principle  must  be  used  to  solve  the  control 
problem. 

Pontryagin's  minimum  principle : A necessary  condition  for 
u*  to  minimize  the  functional  J is  that 

jlf  ^ ^ 

H(x  (t),u  (t),p  (t),t)  _<  H(x  (t),u(t),p  (t),t),  (3.10) 

for  all  t e [t0,tf]  and  for  all  admissible  controls  u.  In  essence, 
Pontryagin's  minimum  principle  states  that  an  optimal  control  must 
minimize  the  Hamiltonian  H globally,  not  locally. 

The  solution  to  the  constrained  control  problem  then  is 
to  solve  Equations  (3.4),  (3.5),  and  (3.10)  instead  of  (3.6).  The 
boundary  conditions  remain  unchanged. 


3.3  Three  Major  Types  of  Control  Problems 

For  the  solution  of  our  missile  control  problem,  we  will 
be  interested  primarily  in  three  variations  of  the  functional  J: 
the  minimum  time,  minimum  fuel,  and  minimum  energy  problems.  The 
solutions  of  these  three  problems  will  be  stated  for  a linear, 
time-varying  system.  The  example  used  to  illustrate  these  three 
types,  as  well  as  the  missile  control  problem  in  which  we  are 
interested,  is  time-invariant  however. 

A linear,  time-varying  state  system  is  represented  by 

x ( t ) = a(x(t) , t)  + B(x(t) ,t)u(t)  (3.11) 

where  x is  n x 1,  a is  n x 1,  u is  m x 1,  and  B is  n x m.  We 
assume  that  the  control  restraints  are  normalized,  that  is, 

|u.(t) | < 1 , i = l,...m  . (3.12) 

Throughout,  the  object  will  be  to  take  the  system  to  the  origin, 
whether  the  final  time  t^  is  specified  or  left  free. 


( 


We  define  the  three  types  of  problems. 


Definition  3.1.  If  the  functional  J is  given  by 

tf 

J = J0  1 dt  , (3.13) 

then  the  control  problem  is  referred  to  as  the  minimum 
time  problem. 

Definition  3.2.  If  the  functional  J is  given  by 


J ■ to  iSAKw 


dt  , 


then  the  control  problem  is  referred  to  as  the  minimum  fuel 
problem. 

Definition  3.3.  If  the  functional  J is  given  by 

t. 


J = / r 


n 2 

.1  a .u. (t) 
i=l  l l 


d t 


(3.14) 


(3.15) 


then  the  control  problem  is  referred  to  as  the  minimum  energy 
problem . 


The  next  three  sections  state  the  solution  to  each  type 
of  control  problem  and  use  the  two-state  system, 


xl  (O  = *2(t) 

(3.16) 

x2(t)  = u ( t ) 

(3.17) 

| i 1 , tf  free  , 

(3.18) 

as  an  example.  Note  that  u has  only  one  component,  which  will 
also  be  the  case  in  our  missile  problem. 


3.3.1  The  Minimhm  Time  Problem 


The  form  of  the  optimal  controller  u*  is  given  for 
i = 1 , 2 , . . .m  by 


1 

for 

p1 (t)b. (x(t)  ,t)  < 

0 

u*A(t)  =< 

-1 

for 

pT(t)bi (x(t) ,t)  > 

0 

(3.19) 

undetermined 

for 

pT(t)b1 (x(t)  ,t)  = 

0 

where  b^(x(t),t)  = i column  of  the  matrix  B.  Note  that  the 
minimum  time  controller  is  of  the  'bang-bang'  variety  (maximum 
effort  throughout  the  interval)  . 


When  the  system 

x(t)  = Ax(t)  + Bu(t) , (3.20) 

is  time  invariant  and  controllable,  several  theorems  apply,  such  as: 

Theorem  3.1.  (Existence  and  Uniquences).  If  all  of  the  eigenvalues 
of  A have  non-positive  real  parts,  then  a unique  optimal  controller 
exists  that  transfers  any  initial  state  xQ  to  the  origin. 

Theorem  3.2.  (Number  of  Switchings).  If  the  eigenvalues  of  A are 
all  real,  and  a (unique)  time-optimal  control  exists,  then  each 
control  component  can  switch  at  most  n - 1 times. 

In  our  example,  since  the  eigenvalues  of  A are  both  zero, 
then  a unique  optimal  controller  exists,  and  it  can  change  sign  at 
most  once. 


For  our  example,  the  Hamiltonian  H is  given  bv 


H = 1 + p. 

L(t)x2 

(t)  + 

P2(t)u(t) 

(3. 

21) 

and  the  optimal 

controller  u by 

-1 

for 

P9(t)  > 0 

u*(t)  = -< 

z 

(3. 

,22) 

[+1 

for 

P2(t)  < o. 

The  solution  to 

the  costate  equation 

is 

P 

L(t)  = 

*i 

(3. 

,23) 

P2(0  = 

-v 

+ 7T  2 • 

(3. 

,24) 

As  can  be  seen  from  Equation  (3. 24),  the  control  u can  change 
sign  at  most  once.  The  possible  optimal  control  sequences  are 
{+1},  {-1},  {+1,-1},  {-1,+1}. 

To  find  the  switching  curve  in  the  state  space,  we  solve 
the  state  equations  with 

u = A = + 1 

to  obtain 


x2(t)  = At  + §2 

xt(t)  = ,|t2  + §2t  + §1  . 


(3.25) 

(3.26) 
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Eliminating  the  parameter  t,  we  obtain 

x (t)  = ^*2 ( t ) + t , (3 

where  x = t(A,§^,§2)  is  a constant.  The  trajectories  are  given  in 
Figure  3.1  with  the  solid  line  corresponding  to  the  (u  = +1) 
command,  and  the  dashed  line  corresponding  to  the  (u  = -1) 
command. 


Figure  3.1.  Trajectories  for  u = ± 1 


The  switching  curve  AOB  is  composed  of  the  two  portions  of  the 
parabolas  passing  through  the  origin;  its  equation  is  given  bv 


xl(t)  = -sgn(x2(t))x2(t)/2 


s ( x ( t ) ) = x1(t)  + sgn(x2(t))x2(t)/2  = 0 
and  is  depicted  in  Figure  3.2. 


(3.28) 


(3.2Q) 


Figure  3.2.  The  Switching  Curve 


To  summarize,  the  solution  to  the  control  problem  is  to  apply 
a (u  = -1)  control  for  those  initial  states  which  lie  above  (s(x(t))  > 0) 
the  switching  curve.  The  resulting  trajectory  will  then  intersect  the 
switching  curve  AOB  at  which  time  the  control  switches  to  +1.  In  a 
similar  fashion,  for  those  points  below  (s(x(t))  < 0)  the  switching 
curve,  the  reversed  sequence  will  bring  the  system  to  the  origin 
(see  Figure  3.3). 

x?(t) 


Figure  3.3  Optimal  Trajectories  for  Various  Initial  States 

I 

3.3.2  The  Minimum  Fuel  Problem 

The  form  of  the  optimal  controller  is  given  for  i = l,...m  by 

1.0  for  p^ ( t)b . (x( t) , t)  < -1.0 

0.0  for  -1.0  < pT(t)bj (x( t) ,t)  < 1.0 

— T __  _ 

-1.0  for  1.0  < p (t)b  (x(t)  ,t)  (3.30) 

v if  pT(t)b . (x(t) ,t)  = -1.0 

-w  if  pT(t)bi (x(t)  ,t)  = 1.0 

where  v and  w are  undetermined  constants  in  the  interval  [0,1]. 

Note  that  the  minimum  fuel  controller  is  of  the  bang-off-bang  variety. 

For  some  initial  state  values  in  our  two-state  example 
Equations  (3.16)  and  (3.17)),  an  optimal  minimal  fuel  controller  does 
not  exist.  Let  us  examine  the  situation  more  closely.  Our  controller 
is  given  by 


if  we  ignore  the  possibility  of  singular  controls  for  the  moment. 
The  Hamiltonian  H is  given  by 


H = |u(t) | + p1(t)x2(t)  + p2(t)u(t)  . (3.32) 

Note  that  the  state  and  costate  equations  remain  unchanged.  The 
possible  control  sequences  are  {+1},  {-1},  {0},  {+1,0},  {-1,0}, 

{0,+l},  {0,-1},  {l,0,-ll,  {-1,0, +1}.  The  locus  of  points  that  can 
be  driven  to  the  origin  with  control  (u  = +1)  or  (u  = -1)  is  the 
same  as  before  (see  Figure  3.2). 

With  control  (u  = 0) , the  state  trajectories  are 


*2(t)  = §2 

xL(t)  = §2t  + §x 
and  are  sketched  in  Figure  3.4. 


(3.33) 


Figure  3.4.  State  Trajectories  for  u = 0 


Note  that  if  §2  = 0,  then  x^  does  not  change  until  u becomes  non-zero. 
Hence,  unless  the  initial  state  is  the  origin,  the  control  (u  = 0) 
cannot  bring  the  system  to  the  origin;  consequently,  any  control 
sequence  candidate  ending  in  'O'  should  be  dismissed  from  consideration. 

We  can  now  give  some  idea  of  how  to  steer  any  initial  state 
to  the  origin.  If  the  initial  state  ( § 1 0 ) is  in  the  set 

R1  S {(x2»x2)l  (x2>x2)  lies  above  the  switching 

curve  (s(x(t) ) = 0)  and  x2  > 0}  , (3.34) 

then  apply  the  control  u = -1  until  x2  becomes  negative.  Next, 
allow  the  system  to  coast  (u  = 0),  until  the  switching  curve  is  inter- 
sected at  which  time,  the  control  u = +1  will  drive  the  system  to  the 
origin  (see  Figure  3.5). 
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B u=-l 


x?(t ) 


x^(t ) 


Figure  3.5.  Trajectories  for  Initial  Values  in  and  R2 


As  the  drawing  would  indicate,  an  optimal  controller  does  not 
exist  since  if  u is  any  {-1,0, +1}  control  sequence  which  drives  the 
system  to  the  origin,  a more  optimal  one  can  be  obtained  by  switching 
to  'O'  earlier.  The  switching  cannot  be  made  at  x equal  to  zero, 
but  must  wait  until  X2  becomes  negative. 

If  the  initial  state  is  in  R^  defined  by 

r2  e {(xj_,x2)|  (xj,x2)  lies  above  the  switching  curve 

and  x2  < 0},  U ' 

then  the  control  sequence  {0,+l}  will  bring  the  system  to  the 
origin.  The  controller  is  optimal,  but,  interestingly,  it  is  not 
unique  (see  Figure  3.5).  The  reason  for  this  is  that  the  control 
problem  is  singular  in  this  region.  The  control 


u(t)  = v , 0 < v < 1 


(3.36) 


can  be  applied  throughout  the  interval  of  operation.  The 
transversality  condition.  Equation  (3.8),  is  satisfied  if 

H(tf)  = j v | + 0 + p2(tf)v  = 0. 


(3.37) 


Equations  (3.37),  (3.30),  (3.23),  (3.24)  imply  that 


Pj^(t)  = 0 and  p2(t)  = -1  , t e 1 0 , t f ] , 


(3.38) 


The  resulting  trajectory  is  parabolic,  running  through  the 
origin  and  has  been  drawn  in  Figure  3.5.  With  Equation  (3.36) 
holding  for  u,  then 


X2(t)  = vt  + §2 


(3.39) 


and,  so  at  t = t^. 


vtf  + §2 


(3. AO) 


The  time  of  impact  depends  on  v.  An  infinite  number  of  such 
controllers  is  available  to  steer  the  system  to  the  origin. 
Similar  remarks  would  apply  to  the  other  two  complementary 
regions  which  lie  below  the  switching  curve  AOB. 


3.3.3  The  Minimum  Energy  Problem 

The  form  of  the  optimal  controller  is  given  for 
i = l,...m  by  (see  Appendix  A for  a derivation) 

-1.0  if  p (t)bi(x(t)  ,t)  > 2ri;l 

u*i(t)  = ( -p  (t)b.j  (x(t)  ,t)  if  -2rii  < p (t)b1(x(t)  ,t)  < 2rii(3.Al) 


— T _ _ 

if  p (t)b.j_(x(t)  ,t)  <_ -2rii 


Note  that  the  minimum  energy  controller  can  assume  all  values  between 
-l  and  +1;  that  is,  it  is  a continuous  controller  whereas  the  minimum 
fuel  and  minimum  time  controllers  were  discontinuous. 

For  some  initial  states  in  our  two  state  example,  an  optimal 
controller  will  exist;  but,  for  others  it  will  not.  It  will  be  shown, 
however,  that  all  initial  states  can  be  steered  to  the  origin. 

Our  controller  u*  is  given  by 


u*  (t)  = 


-1.0 

if 

P2(t)  2 

-p?(t) 

2 

if 

-2  < p2(t)  < 2 

(3.42) 

+ 1.0 

if 

p?(t)  <_  -2  . 

The  Hamiltonian  is  given  by 

H = u2(t)  + p.(t)x?(t)  + p~(t)u(t)  . 


Note  that  the  state  and  costate  equations  remain  unchanged.  Hence, 
Pl(t)  = ir1  and  p9(t)  = — tt  1 1 + * 

Rather  than  list  all  possible  control  sequences,  let  us 
examine  the  transversality  condition  (Equation  (3.8))  and  its 
consequences  to  narrow  the  list.  The  transversality  condition  is 
that 


(3.43) 


(3.44) 


(3.45) 


H(tf)  = 0 = u (tf)  + Pi(tf)V(tf)  + p2(tf)u(tf)  . 


u(tf)  = 0 or  u(tf)  = -P2(tf) 


In  the  latter  case,  since  the  control  constraint 


|u(t)  < 1 


must  be  met,  it  follows  that 

|P2(tf)l  1 1 

From  the  form  of  the  optimal  control.  Equation  (3.42),  we  also 
have  that 

u(tf)  = -p2(tf)/2  . 

Hence,  from  Equations  (3.49)  and  (3.46),  it  follows  that 

P2(tf)  = 0 , 


which  implies  that 


u(t  ) = 0 


(3.46) 


(3.47) 


(3.48) 


(3.49) 


(3.50) 


(3.51) 


Hence,  in  either  case  of  Equation  (3.46),  we  have 
that  Equation  (3.51)  holds  and  that  consequently 

P9(tf)  = 0 . 

Using  Equation  (3.44),  combined  with  Equation  (3.52),  we  conclude 
that  tt^  and  tt9  have  the  same  sign.  This  last  fact  when  combined 
with  Equation  (3.52)  limits  the  possible  graphs  of  p9(t)  vs.  t to 
those  shown  in  Figure  3.6. 

p2<o  , 

JXf 


(3.52) 


Figure  3.6.  p2(t)  vs.  t 


l 


Figure  3.6  in  conjunction  with  the  form  of  the  optimal 
control  obtained  using  Pontryagin's  minimum  principle  (Equation  3.42) 
guarantees  that  there  are  only  three  possible  optional  control 
sequences : 


{-  p2(t) },  {A,-  P2(t)},  where  A = ± 1 
2 2 


Case  I:  The  control  sequence  P2(t)-> 

2 


We  will  obtain  the  locus  of  all  points  that  can  be 
steered  to  the  origin  with  control  {-  P2(t)/2}.  Since  the  final 
time  tf  is  free,  the  Hamiltonian  will  be  identically  zero  along 


(3.53) 


an  extremal  trajectory, 
are : 


Several  equations  that  will  be  needed 


H(0)  =0  ^2  = 


H(tf)  = 0 
|p2(t)|  < 2 
x 2 ( t ^ ) = 0 
X£(tf)  = 0 


71 1 1 f v2 


l7I2l  1 2 


Tl  9 TTt 

0 = - 2 tf  + x2(0) 

Tfl  2 TTp  2 

0 = -j-jtf  - ^-tf  + x2(0)tf  + x^(  0) . 


(3.54) 

(3.55) 

(3.56) 

(3.57) 

(3.58) 


Several  substitutions  will  be  required  to  obtain 
inequalities  involving  only  x^O),  x2(0).  From  Equations  (3.57) 
and  (3.55),  we  obtain 

tt  0 1 1-  = 4x9(0)  . 

Using  Equations  (3.58)  and  (3.55),  we  have  tiiat 

2 


2 f = x2(0)tf  + x ^ (0)  . 


V 

6 

Substituting  Equation  (3.59)  into  (3.60)  yields 


-jX2(0)tf  = x2(0)tf  + xj(0) 


- 3xx(0) 

= x2(U) 


Substituting  Equation  (3.62)  into  (1.59)  yields 

r2  = - 4 x22(0) 


3 xx(0) 


(3.59) 


(3.60) 


(3.61) 


(3.62) 


(3.63) 
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Placing  this  result  and  Equation  (3.62)  into  Equation  (3.33), 
we  obtain  „ 

4 *2  (0) 

*1  = 9 xp(0) 

Finally,  using  Equation  (3.63)  with  Equation  (3.56)  gives  us 
the  following  inequalities: 

x2  > -V3x  2 or  x2  £ /-3x^/2  . 

Sign  considerations  for  Case  I_ : 

Equation  (3.55)  ->  sgn^^)  = sgn(ir2) 

Equation  (3.63)  -*•  sgn(-rr2)  = -sgn(x^(0)) 

Equation  (3.64)  ■>  sgn(n^)  = sgn(x2(0))  • 

Consequently , 

sgn(x2(0))  = -sgn(x^ (0) ) , (3.66) 

and  we  conclude  that  we  are  confined  to  quadrants  II  and  IV. 

Shape  of  optimal  control  traj  ectories  for  Case  _I : 

It  can  be  shown  that  x2(t)/ir^  is  a monotone,  decreasing 
function  bounded  below  by  zero.  Also,  x^(t)/7r  is  a monotone, 
increasing  function  bounded  above  by  zero.  From  the  implied 
relationship 


(3.64) 


(3.65) 


X2  = ’ 


(3.67) 


it  can  be  established  that  f(x^)  is  concave  upward  in  quadrant  II, 
and  concave  downward  in  quadrant  IV.  Ijji  summary,  any  initial  starting 
point  (x^(0),x9(0)  in  the  region  Rq  U Rq  where 


— 2 3 

R~  = {(x1,x2)|  (x,,x2)  lies  below  the  curve  x.;  = sgnte^)^*^ 

and  x2  > 0} 


(3.68) 


and 


R^  = {(x^,x2)|  (x^,x2)  lies  above  the  curve  x2  = sgn(x^)^x^ 
and  x2  < 0}  , 


(3.69) 


can  be  driven  to  the  origin  in  an  optimal  fashion  with 


u(t)  = - 


P2U) 


(3.70) 


I 


'' 

I 


a 
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Equations  (3.63)  and  (3.64)  give  the  value  of  the  constants 


Let  t^  denote  the  time  when  the  control  changes  from 
A to  -p2(t)/2.  For  0 < t <_  t^,  we  integrate  the  state  equations 
to  obtain 


x2(t)  = At  + x2(0) 


Xj  (t)  = Ay  + x2(0)t  + x^  (0) 


Consequently,  we  can  eliminate  t from  these  two  equations  and 
obtain 


Xj^  = |-(Ax2  - Ax2(0))2  + x2(0)(Ax2  - Ax2  (0) ) + x^(0) 


X1  ~ xl(0)  = f(x2  ' x2(0))  ' 


Equation  (3.74)  is  a parabola  in  the  state  space  with  vertex 


xx  = xx(0)  - |x2(0) 


x2  = 0 . 


From  Equation  (3.74),  the  parabolas  passing  through 
the  origin  are  _ 


""V 


When  A = +1,  we  can  see  from  Equations  (3.71),  (3.72)  that 

x2(t)  + x2 ( t-^)  and  so  x2(0)  < 0 . (3.78) 

Similarly,  when  A = -1,  then 

x 2 ( t ) + x9(t1)  and  so  x2(0)  > 0 . (3.79) 

To  summarize,  the  state  space  consists  of  three 
major  regions:  those  points  that  can  be  brought  to  the  origin 
with  u = {-p2(t)/2} (R  ) ; those  points  that  can  be  brought  to 
the  origin  with  the  control  switching  sequence  u = { A ,-p2 ( t) / 2 } (R^) ; 
and  all  other  points  for  which  there  does  not  exist  an  optimal 
control  (R„) . Figure  3.8  shows  various  trajectories  for  these 
regions.  The  definitions  of  these  regions  follows  the  figure. 


Figure  3.8.  Various  Optimal  Control  Trajectories 


Although  regions  RQ  and  R+  were  defined  previously,  their  definitions 
are  repeated  with  those  of  the  new  regions  for  the  sake  of  completeness. 


; { ( x ^ , x2 ) I 0 < x2  <_  ^-3x^/2 } 


Rq  H {(x1,x2)|  0 > X,  > -/3Xj / 2 ) 


Ro  s Ro  u Ro 


(3.80) 

(3.81) 
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(3.82) 


R1  = {(x1,x2)|  /-3x^/2  < x2  < /-2x1J 


r|  = {(x1,x2)|  -/-3x^/2  > x2  > -/2x1  (3.83) 


R2  = R~  u R+ 

R2  = ^ (xi >x2^ 1 x2  1 

0 and  (x  ,x2)  i Rq  b R^ 

(3.84) 

R2  = ^xi’x2)  1 x2  — 

0 and  (x^,x2)  j U R^} 

(3.85) 

R~  = {(x1,x2) | 

x2  = /-2x^} 

(3.86) 

R+  = { (x1,x2) | 

x2  = -/2x~}  • 

(3.87) 

The  optimal  solutions  to  the  minimum  energy  problem 
are  restricted  to  Rq  U R^.  Sub-optimal  solutions  may  be  used 
in  the+rest  of  the  state  space:  {A}  on  R UR  and  (-A,-%>2(t) } on 
R2  U Ro.  Note  that  the  switching  of  the  control  u from  -A  to 
-p2(t)/2  must  occur  some  arbitrarily  small  distance  inside  the 
region  R^  and  not  on  the  x^  axis. 

3.4  Some  Remarks  about  the  Three  Types  of  Control  Problems 

First,  it  should  be  noted  that  there  are  other  types  of 
control  problems  than  the  three  mentioned  in  section  3.3.  These 
are  of  interest  to  us  because  we  can  formulate  our  missile  control 
problem  in  any  of  the  three  forms  (see  Chapter  IV) . 

Some  observations  are  in  order  about  the  three  control 
problems.  In  terms  of  difficulty,  the  minimum  time  problem  is  the 
easiest  to  solve,  the  minimum  fuel  would  be  next,  and  the  minimum 
energy  last.  This  is  due  mainly  to  the  general  form  of  the 
controller  for  each  case.  More  important,  however,  is  the  form  of 
the  controller  as  it  pertains  to  the  problem  of  implementation. 

The  minimum  time  controller  is  a bang-bang  type  (±1),  and 
the  minimum  fuel  is  a bang-off-bang  controller  (Q,±l) . The  minimum 
energy  one  can  take  on  all  values  between  +1  and  -1.  This  controller 
is  continuous,  whereas  the  other  two  are  not.  From  an  implementation 
point  of  view,  the  minimum  energy  controller  would  be  preferable 
since  the  input  to  the  system  would  be  changing  in  a less  drastic 
manner.  Furthermore,  in  a system  that  identifies  the  control  with 
tail  fin  deflection,-  instantaneous  changes  are  physically  unrealizable. 
Because  of  the  continuity  of  the  controller,  one  would  expect  the 
implemented  version  of  this  control  to  closer  approximate  the 
idealized  version  than  would  be  the  case  with  the  other  two.  The 
tradeoff,  of  course,  is  the  additional  mathematical  complexity  in 
obtaining  the  solution  to  the  control  problem  in  the  first  place. 


*1 
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Chapter  IV 

THE  MLSSILE  CONTROL  PROBLEM  WITH  CONTROL  CONSTRAINTS 

4.1  Missile  - Target  Geometry  and  Dynamics 

Our  objective  is  to  control  a missile  in  such  a way  as  to 
impact  on  the  target  with  the  center  of  the  missile  being  vertical; 
that  is,  we  want  not  only  to  insure  that  we  hit  the  target,  but  we 
want  also  to  impose  conditions  of  the  missile's  attitude  at  imDact. 
Figure  4.1  depicts  the  geometry  of  the  terminal  guidance  phase, 
and  Table  4.1  defines  the  variables  introduced. 


Table  4.1.  Definition  of  Variables 


Variable 


Def inition 


Missile  position  variable  projected  on  the  ground  (ft) 
Target  position  variable  (ft) 

Position  variable  from  missile  to  target  projected  on 

the  ground  (Y.  = Y -Y  ) 
d t m 

Time  derivative  of  Y.  (ft/sec) 

d 2 
Lateral  acceleration  of  the  missile  (ft/sec  ) 


0 


Body  attitude  angle  of  the  missile  (deg) 


The  following  set  of  state  variables  was  chosen  for  the 
modeling  process. 


(4.1) 


Under  the  assumption  that  the  angle  of  attack  a is  small  and  can 
thus  be  neglected,  the  state  variables  are  subject  to  the  following 
dynamics : 


Y,  = Y, 
d a 


Y , = -A  cos  6 
d L 


Al  = -wxAl  + KjU 


0 = K u . 

a 

Performance  is  considered  to  be  acceptable  when  the  following 
constraints  are  satisfied: 

lYd(tf) I 1 5 feet 


| 0 ( t f ) | <_  5 degrees 


4.2  Formulations  of  the  Missile  Control  Problem 

We  will  impose  the  normalized  control  constraints 
|u(t) | < 1 

on  our  problem.  In  order  to  facilitate  our  goal  of  obtaining 
a closed  form  solution,  we  will  add  an  additional  assumption  that 
will  reduce  the  number  of  state  variables  to  three.  To  this  end, 
we  assume  that  the  autopilot  has  zero  lag.  This  means  that 
Equation  (4.4)  can  he  omitted,  and  that  we  can  rewrite  Equation 
(4.3)  by  using  the  relationship 


A = — u . 

L Wi 

We  linearize  our  differential  equation  (4.3)  about  the  impact 
conditions , 

b = cos  0 , 


(4.2) 


(4.3) 


(4.4) 


(4.5) 


(4.6) 


(4.7) 


(4.8) 


(4.9) 


(4.10) 
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and  obtain  the  following  three  state  system  dynamics: 


Yd 

-bKx 

ll 

W. 

(4.11) 

(4.12) 

1 

K a . 
a 

(4.13) 

4.2.1  Minimum  Time  Problem 

Find  a controller  u,  satisfying 

|u(t) | < M , (4.14) 


that  will  drive  the  system  (4.11),  (4.12), 
and  will  minimize 


1 dt 


(4.13)  to  the  origin 


(4.15) 


At  first  glance,  one  might  think  that  our  control 
problem  as  outlined  in  section  4.1  cannot  be  viewed  as  a 
minimum  time  problem,  but  that  is  in  fact  incorrect.  The  key 
lies  in  the  selection  of  state  variables.  Minimum  time  in  this 
context  means  far  more  than  getting  to  the  target  in  the  shortest 
possible  time.  The  end  condition 

Y,(t,)  = 0 (4.16) 

d t 

means  that  the  miss  distance  at  impact  is  to  be  zero.  The  end 
condition  for  the  second  state  variable 

Yd(tf)  = 0 (4.17) 

implies  that  the  velocity  vector  is  to  be  directed  straight 
downward.  Finally,  the  last  state's  end  condition 

0 ( t f ) = 0 (4.18) 


insures  that  the  center  line  of  the  missile  is  perpendicular  to 
the  horizon  upon  impact. 

This  particular  way  of  viewing  our  control  problem  as 
one  with  final  time  t^.  unspecified  possesses  an  especially  desirable 
feature  from  an  implementation  point  of  view.  It  is  very  difficult 
to  obtain  an  accurate  estimate  of  final  time  with  our  lack  of  range, 
range-rate  information.  A control  law  in  which  control  changes  are 
made  depending  only  on  state  measurements  and  not  on  time-to-go  is 
preferable . 
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4.2.2  Minimum  Fuel  Problem 

The  dynamics  remain  unchanged.  The  control  problem  then 
is  to  drive  the  system  to  the  origin  with  a controller  that  will 
minimize 

J = J0  |u(t) | dt  . (4.19) 

This  controller  would  be  somewhat  more  desirable  than 
the  minimum  time  controller  since  it  has  the  added  'coasting' 
command.  This  controller  too  is  time  invariant,  and  the  system 
would  be  driven  by  state  measurements. 


4.2.3  Minimum  Energy  Problem 


Again,  the  dynamics  remain  unchanged.  The  control 
problem  is  to  drive  the  system  to  the  origin  with  a controller 
that  will  minimize 


(4.20) 


This  controller  would  yield  the  desired  type  of 
trajectory  and  wouid  be  the  most  desirable  since  for  optimal 
trajectories  it  changes  in  a continuous  fashion.  It  too  would 
bf  time  invariant,  and  state  measurements  only  would  drive  the 
system. 


4.3  Solution  of  the  Missile  Control  Problem  as  a Minimum 

Time  Problem 


There  are  two  important  concerns  to  investigate  before 
trying  to  solve  the  minimum  time  control  problem  with  dynamics 


and  state  vector 


Yd  " Yd 


(4.21) 


-bK 

Y , = — u 

d 


K u 
a 


u < M 


(4.22) 

(4.23) 


X . [vd,Yd,ei 


(4.24) 


First,  check  to  see  if  the  system  is  controllable,  and, 
secondly,  check  to  see  if  the  matrix  A is  in  Jordan  cannonical 
form  where  the  system  is  viewed  as 


X = AX  + Bu 


(4.25) 


4.3.1  Controllability 


A system  of  the  form  given  in  Equation  (4.25)  where 
X has  n states  is  controllable  if  and  only  if 

rank  [B ,AB ,A^B, . . . A°  *B]  = n 


(4.26) 


For  our  system,  we  have  that 


0 

1 

0 

0 

A = 

0 

0 

0 

and 

B = 

-a 

0 

0 

J 

k 

where 


a = K^b/w^  and  k 


K 

a 


2 

Calculating  the  matrix  [B,AB,A"Bj,  we  obtain 


M = 


0 -a  0 
-a  0 0 

k 0 0 


(4.27) 


(4.28) 


(4.29) 


which  has  rank  2.  We  conclude  then  that  our  particular  system 
is  uncontrollable  in  the  sense  that  it  will  not  be  possible  to  steer 
an  arbitrary  point  in  x^-x„-x^  space  to  the  origin.  Since  it  is 
more  important,  however,  that  x^  and  x^  be  driven  to  zero  than  x„, 
we  will  continue  with  the  solution  to  investigate  this  possibility. 


4.3.2  Jordan  Cannonical  Form  of  A 

When  necessary  a change  of  variables  whould  be  introduced 
to  insure  that  A is  in  Jordan  cannonical  form  as  this  will  later 
simplifv  the  equations  of  various  switching  curves.  Our  matrix  A 
has  an  eigenvalue  0,  of  multiplici tly  three.  The  characteristic 
polynomial  A(t)  of  A is 

A ( t)  H det [ tl  - A]  = t3  , (4.30) 

and  the  minimum  polynomial  m(t)  of  A is 

m(t)  = t2  . (4.31) 

The  Jordan  cannonical  form  is  a block  diagonal  matrix 

with  blocks  J. . of  the  form 
ij 
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w 


0 


1J 


A . 1 . 

1 

0 A . . 

l 


0 

0 0 


(4.32) 


G 0 


0 


. . A . 1 

i 

. . OX. 


Since  there  is  at  least  one  J..  of  order  2 (the  power  of  (t-A.) 
in  the  minimum  polynomial),  and  the  sum  ol  the  orders  of  J..'s 
is  equal  to  3 (the  power  of  (t-A.)  in  the  characteristic 
polynomial),  there  can  be  only  one  possible  form  for  the 
cannonical  form  of  the  matrix  A,  namely. 


0 

i 

0 

J = 

0 

0 

0 

0 

0 

0 

already 

in 

Jordan 

4.3.3  An  Altered  Minimum  Time  Problem 

Section  4.3.1  showed  that  our  system  was  not  controllable 
in  the  sense  that  for  an  arbitrary  initial  value  (xyQ’x20’X30^  we 
could  not  drive  all  three  states  to  zero.  Since  the  rant  of  the 
matrix  M was  2,  we  should  be  able  to  drive  two  of  our  state  variables 
to  zero.  Returning  to  the  physical  meaning  of  our  states,  it  is 
clear  that  it  is  most  important  that  the  miss  distance  (x.)  and  the 
attitude  angle  (x^)  be  zero  at  final  time.  A non- zero  value  for 
X2(t)_)  can  be  tolerated,  and  in  fact,  there  will  be  some  best  possible 
value  c for  k^. 

Control  Problem : Find  a controller  v(t)  to  steer  the  system 


*1  = X2 

• 

x„  = —bv 

* Wl 


(4.33) 

(4.34) 


to 


X.  = K v 

3 a 


lx(tf)]T  = [0,c ,0]T 


in  the  shortest  possible  time  subject  to  the  constraint 

| v ( t ) ] < M . 


(4.35) 


(4.36) 


(4.37) 
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Change  of  Variable  to  Normalize  the  Controller:  Define 


K,b  M 

v 1 . 

u = — , a = , k = h M . 

M wx  a 


Then  the  system's  new  state  equations  are 


X1  = X2 


= -au 


x^  = ku 


where  u must  satisfy  the  constraint 

|u(t)|  < 1 . 

Determination  of  c:  From  Equations  (4.40)  and  (4.41), 
kx^  + ax j = -aku  + aku  = 0 . 

Integrating,  we  get 

k(x2(t)  - x2(0))  + a(x.3(t)  - x^(0) ) = 0 . 

Letting  t = t^  we  see  that  when  x^  is  driven  to  zero, 

kx„(0)  + ax_(0) 

x2(tf)  = ~ k " H c • 

Solut ion  of  the  State  Equat ions : Assuming  that  u = A = ±1, 
the  solution  of  the  state  equations  is 


(4.38) 


(4.39) 


(4.40) 


(4.41) 


(4.42) 


(4.43) 


(4.44) 


(4.45) 


xx  = + x2(0)t  + x1(0) 


x2  = -aAt  + x2(0) 
x^  = kAt  + x.j(0)  . 


(4.46) 

(4.47) 

(4.48) 


Projection  of  the  Swi tchlng  Curves : Solving  Equation  (4.47) 
for  t and  substituting  into  Equation  (4.46)  yields 

x1  - x1(0)  = -|^(x2  - x2(0))  . 


(4.49) 


From  Equation  (4.44),  and  the  definition  of  c,  we  have 


kx2  + ax^  = kc  . 


(4.50) 
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Since  we  would  like  to  investigate  the  projection  of  the  state 
trajectories  in  the  - x^  plane,  we  evaluate  Equation  (A. 49) 
at  t = t and  make  a substitution  relating  x„  to  x~.  From 
(4.49) , since 

x^t^)  = 0 and  x (t  .)  = e . 

we  have  that 

*1<°>  ’ tl<'2  - *2<«»  ' 

From  Equation  (4.50)  at  t = 0,  we  obtain 

kc  - ax„(0) 

x2(0)  = ~ . 

Substituting  into  Equation  (4.52),  we  have  the  desired  result 


x/O)  = ~ -2(a2x^(Q)  - 2akcx^(0) ) . 

2ak 

The  shape  of  the  locus  of  initial  points  in  the  - x^  plane 
that  can  be  driven  to  the  origin  is  a parabola. 

Qualitative  Discussion : From  Equations  (4.47),  (4.48), 
we  see  that 


and 


A = +1  implies  that  x2+  and  x^t 


A = -1  implies  that  x2+  and  x^l  . 


From  Equation  (4.54),  we  also  note  that 

A = +1  implies  that  the  parabola  opens  left 


and 


A = -1  implies  that  the  parabola  opens  right  . 

2 

The  x^-intercepts  of  the  parabola  are  0 and  — — . 

Graphs : Figure  4.2  shows  trajectories  (projections)  in  the 

X2  ~ X3  P^ane>  while  Figures  4.3a  and  4.3b  show  the  switching 
curve  in  the  x^  - x^  plane. 
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(4.51) 

(4.52) 

(4.53) 

(4.54) 

(4.55) 

(4.56) 

(4.57) 

(4.58) 


The  Control  Sequence  ( -A  , A } : First,  calculate  c,  which  depends 
on  X2(0),  x^(0).  In  Figure  4.1a,  if  the  initial  point  lies  above 
the  switching  curve,  apply  u = -1  until  the  switching  curve  is 
intercepted,  and  then  change  to  u = +1.  The  opposite  would  be  true 
if  the  initial  point  lies  below  the  switching  curve.  The  same 
comments  apply  to  the  switching  curve  for  c < 0 given  in  Figure  4.3b. 
Several  trajectories  going  to  the  origin  have  been  shown  in  Figure  4.4. 


Figure  4.4.  Various  Optimal  Control  Trajectories 

Conclusions  and  Remarks : Figure  4.4  shows  that  it  is  mathematically 
possible  to  take  any  initial  point  in  x^-x^-x  space  and  drive  x and 
x^  to  zero,  x^  to  c in  an  optimal  manner.  Although  mathematically 
correct,  this  turns  out  to  be  physically  impossible  since  the  mathematical 
model  used  was  not  stated  as  precisely  as  it  should  have  been.  The 
error  is  that  there  are  inequality  constraints  on  the  state  variables 
which  have  been  overlooked.  The  switching  curves  in  Figure  4.3  are 
not  actual  trajectories  since  the  missile  will  flip  when  x changes 
sign  to  a negative  value.  A sign  change  in  x^  means  that  the  missile 
has  overflown  the  target. 

In  order  to  make  the  model  more  closely  represent  the  physics 
of  the  situation,  these  inequality  constraints  must  be  incorporated  into 
the  model.  According  to  the  definition,  we  must  have  x^  and  x^  positive 
throughout  the  flight,  and  X2  negative. 


Chapter  V 


COMPUTER  SIMULATION  IMPLEMENTATION 


5.1  Effect  of  the  State  Inequality  Constraints 

As  was  mentioned  at  the  end  of  Chapter  IV,  although  it 
was  possible  to  drive  miss  distance  x^  and  attitude  angle  to 
zero  in  the  imaginary  world  of  mathematics,  the  physical  world 
imposes  constraints  that  will  prevent  this  in  practice.  Reality 
imposes  the  following  constraints  on  our  state  variables: 

(5.1) 

(5.2) 

(5.3) 

Given  the  initial  value  of  velocity,  xo(0),  and  missile 
attitude,  x^(0),  a value  for  the  constant  c is  calculated  where 
c is  defined  by 

kx„(0)  + ax„(0) 

c= j—3 . (5.4) 

If  c is  non-negative,  then  the  switching  curve  given  in  Figure  4.3a 
applies;  whereas,  if  c is  negative,  then  Figure  4.3b  applies. 

Should  c be  negative,  then  for  certain  initial  conditions,  it  will 
be  possible  to  drive  both  x.  and  x~  to  zero.  However,  should  c 
be  positive,  then  we  can  drive  x^  to  zero,  but  there  will  be  some 
non-zero  best  possible  value  for  x^  at  t^.. 


Xj  > 0 

x2  1 0 


x3  - ^ * 


5.1.1  Case  I.  c < 0 

The  constraints.  Equations  (5.1),  (5.2),  (5.3)  force  an 
admissable  trajectory  to  lie  entirely  in  the  first  quadrant  when 
the  trajectory  projection  in  the  x^  - x^  plane  is  considered. 

Should  the  initial  state  lie  above  the  switching  curve,  point  A for 
example,  then  it  is  not  possible  to  drive  x„  to  zero  at  impact  as 
there  is  insufficient  time  for  the  missile  to  react.  The  best  that 
can  be  done  is  to  give  a hard  over  tail  fin  command  to  bring  the  nose 
of  the  missile  down  (trajectory  AE  in  Figure  5.1). 

Should  the  initial  state  lie  on  the  switching  curve,  most 
unlikely,  a command  of  {-1}  will  bring  both  states  to  zero.  With  the 
initial  condition  lying  below  the  switching  curve,  point  B for  example, 
then  a {+1,-1}  sequence  will  bring  x^  and  x to  zero  most  rapidly 
(trajectory  BCO  in  Figure  5.1).  It  should  be  kept  in  mind,  however, 
that  since  our  math  model  is  a rather  simple  reflection  of  reality, 
perhaps  a better  control  strategy  might  be  the  sequence  {0,-1}. 

It  too  will  drive  both  variables  to  zero.  A computer  simulation  that 
is  more  realistic  than  the  three-state  system  is  needed  to  decide 
questions  of  this  type. 


Figure  5.1.  Various  Trajectories  for  Case  1 


5.1.2  Case  II.  c > 0 

When  c > 0,  it  is  not  possible  to  drive  both  x^  and 
to  zero  at  t . In  a similar  fashion,  if  the  initial  point  lies 
above  the  switching  curve,  a hard  over  command  is  the  best  that 
can  be  done.  If  the  initial  state  lies  on  the  switching  curve, 
then  the  upper  x^- intercept  is  the  best  possible  value  that  can 
be  achieved  for  x^  with  control  {-1}.  Similar  concerns  as  before 
apply  to  those  initial  values  below  the  curve.  Is  the  control 
sequence  {+1,-1}  or  {0,-1}  best,  or  perhaps  u = v where  v is  a 
negative  constant  between  0 and  -1?  Again  a simulation  is  needed 
to  decide. 

5.1.3  Physical  Meaning  of  Case  I and  II 

Combining  the  definition  of  c given  by  Equation  (5.4) 
with  the  constraints  on  x^,  and  x^,  we  can  derive  an  inequality 

relating  x (0)  to  x^(0)  for  both  cases.  Since  k is  positive, 

Equation  (5.4)  implies  that 

kx2(0)  + ax^ (0)  < 0 (5.5) 

or 

x2(°)  < -fx3(0)  . (5.6) 

Since 

*3(t)  2.0  and  x2(t)  <_  0 , (5.7) 

it  follows  that  for  c < 0, 

x2(0)  < -f*3(0)  1 0 • (5-8) 
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Similarly,  for  c > 0, 


< x2(0) 


< 0 


(5.9) 


Returning  to  the  physical  meaning  of  our  state  variables, 
this  would  say  that  it  is  possible  to  determine  whether  or  not  the 
desired  trajectory  can  be  achieved  by  comparing  missile  velocity, 
x~(0),  to  a multiple  of  missile  attitude,  -ax„(0)/k.  If  Equation 
(5.9)  holds,  the  missile  velocity  is  not  sufficiently  large  in 
magnitude  to  achieve  the  stated  objective.  If  Equation  (5.8) 
holds,  the  missile  velocity  is  sufficiently  large  to  drive  both 
variables  to  zero. 


5.2 


Implementation  of  the  Controller  for  the  Three 
State  System 


5.2.1  The  Linear  and  Non-linear  Three  State  System 


The  following  procedure  was  adopted  in  trying  to  implement 
our  control  strategy  with  the  three  state  system.  First,  our  system 
in  the  computer  simulation  was  kept  identical  to  the  normalized 
linear  system  given  by  Equations  (4.39),  (4.40),  (4.41).  Then  this 
system  was  replaced  by  the  non-linear  system 


X1  x2 


x„  = T-cosO-u 
2 b 


x^  = ku 


< 1 


(5.10) 

(5.11) 

(5.12) 


which  is  a normalized  version  of  the  original  non-linear  three 
state  system  given  by 


Y = Y 
d d 


Y = ” — cos©*v 
d w^ 


(5.13) 

(5.14) 


K v 
a 


(5.15) 


The  control  value  for  u depends  on  whether  or  not  the 
point  (x^,x_)  lies  below  or  above  the  corresponding  switching 
curve  (c  >_  0 or  c < 0) . The  implementation  uses  an  altered  form 
of  Equation  (4.54).  Cancelling  the  a on  the  right  hand  side  of 
(4.54),  we  obtain  for  the  equation  of  the  switching  curve 


(0)  = ~-T(ax^(0)  - 2kcx.  (0) ) 
2kz  J J 


(5.16) 


. 


A 
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Note  that  this  one  equation  represents  both  cases  since  c is 
included.  A point  (x-,,x-,)  will  thus  lie  below  the  switching 
curve  (see  Figure  5.1;  if 

x (0)  > ---A-„(ax2(0)  - 2kcx  (0))  (5.17) 

2k  J 

or 

x (0)  + -AT(ax2(0)  - 2kcx  (0))  > 0 . (5.18) 

2kZ 

Equation  (5.18)  was  used  in  the  simulation. 


5.2.2  Selection  of  Control  Strategy 

The  problem  of  selecting  a control  strategy  could 
not  be  pursued  at  this  stage  for  reasons  which  will  shortly 
become  apparent.  The  computer  implementation  as  mentioned 
above  (see  Appendix  B for  a listing)  was  executed  with  the 
additional  feature  of  updating  c every  one  tenth  of  a second 
throughout  the  flight  for  greater  accuracy.  Some  difficulty 
was  experienced  in  evaluating  K which  appears  in  the  differ- 
ential equation  for  9.  The  simulation  would  drive  both  9 and 
Y^  to  zero  at  the  same  time,  but  it  would  have  no  way  of 
detecting  the  time  of  impact.  On  some  runs,  the  missile  would 
overfly  the  target  with  9 achieving  zero  as  it  did  so. 
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Chapter  VI 


CONCLUSIONS  AND  RECOMMENDATIONS  FOR  FUTURE  STUDY 


The  implemented  control  law  in  section  5.2.2  did  not 
accomplish  what  was  intended.  Although  it  did  drive  0 and  Y^ 
to  zero  at  the  same  time,  this  was  not  always  the  time  at  impact. 

With  a little  reflection,  one  realizes  that  this  is  all  the  control 
law  was  designed  to  do.  The  concept  of  'above'  or  'below'  the 
ground  does  not  appear  in  the  formulation  of  the  problem.  It 
would  seem  that  this  can  be  overcome  by  electing  to  drive  Y^  and 
Y to  zero  at  t ^ . This  selection  would  shape  the  trajectory  so 
tnat  the  missile  would  have  a high  attitude  angle  at  impact. 

This  approach  is  being  pursued  and  results  of  it  will  be  communi- 
cated to  the  contract  supervisor.  Had  there  been  more  access  to 
the  computer,  perhaps  this  difficulty  might  have  been  discovered 
sooner  and  corrected.  We  conclude  with  some  recommendations  for 
future  study. 

The  more  accurately  the  mathematical  control  model  reflects 
the  actual  system,  the  better  the  chances  of  getting  the  corresponding 
control  law  to  work.  For  our  missile  problem,  the  most  accurate 
representation  of  the  actual  system  is  our  6D0F  computer  simulation 
which  contains  at  least  thirty  to  forty  coupled,  first  order,  ordinary 
differential  equations.  To  produce  a simpler  system  for  which  a 
closed  form  mathematical  solution  of  the  control  problem  might  be 
feasible,  a selection  of  the  more  important  states  must  be  made.  For 
us,  this  next  simpler  model  was  a four  state  system  using  the  state 
variables  Y^,  Y , 0,  and  A . We  decided  that  rather  than  try  to 
solve  this  problem,  we  would  simplify  the  system  representation  even 
further  to  a three  state  one  for  which  we  could  obtain  a control  law. 

The  assumption  that  was  made  to  produce  this  last  model  was  that  the 
lag  in  the  autopilot  could  be  ignored. 

Because  of  the  form  of  the  resulting  system,  our  control 
problem  turned  out  to  be  not  completely  controllable.  This  fact 
simplified  the  mathematics  considerably,  and  a control  strategy  was 
devised  which  would  drive  the  miss  distance  Y,  to  zero  along  with  the 
attitude  angle  0 at  impact.  The  other  variable,  Y^,  could  not  be 
driven  to  zero  at  final  time  except  for  rather  special  sets  of  conditions 
Due  to  this  controllability  problem,  our  three  state  system  was  effec- 
tively reduced  to  a two  state  one. 

Based  upon  our  experiences,  we  would  recommend  that  there 
might  be  some  merit  in  working  with  a control  for  the  four  state 
system.  Generally,  such  mathematical  solutions  are  almost  impossible 
to  obtain,  but  if  this  system  has  some  of  the  controllability  features 
of  the  three  state  one,  then  the  resulting  control  problem  might  very 
well  turn  out  to  be  like  a three  state  system  that  is  completely 
controllable.  A closed  form  solution  might  well  be  obtained  for  such 
a system. 


; 
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Remark:  Of  the  three  books  on  optimal  control  theory,  the  text  by 
Kirk  in  the  Prentice-Hall's  Electrical  Engineering  Series  is  the  most 
readable.  The  book  by  Athens  and  Falb  and  the  book  by  Bryson  and  Ho  are 
more  rigorous  than  Kirk's,  with  the  Bryson-Ho  book  being  probably  the 
most  difficult  text  to  read.  It  has  numerous,  interesting  applied  control 
examples  however.  The  Athens-Falb  book  is  almost  encyclopedic  in  nature, 
consisting  of  about  900  pages. 


Appendix  A:  Derivation  o£  the  Minimum  Energy 


Optimal  Control  Law 


Problem : Determine  the  form  of  the  optimal  control  for 
the  system 


x ( t ) = a (x(t) , t)  + B(x(t) ,t)u(t) 


in  order  to  drive  the  system  to  the  origin  so  as  to  minimize 
the  functional 


tf_T 

= ft  u (t)Ru(t)  dt  , 
o 


where  t.  is  free,  R a diagonal  matrix  with  positive  elements, 
Assume  that 


|ui(t)  I <_  1 , i = 1, . . .m  . 


Solution:  Define  the  Hamiltonian  H to  be 

T 


jr  _t  

H = u (t)Ru(t)  + p (t)a(x(t),t)  + p (t)B(x(t)  ,t)u(t) . 


If  the  control  constraints  are  not  violated,  then 

T 


SH 


0 = — = 2Ru(t)  + B (x(t),t)p(t)  . 


Equation  (A-5)  implies  that 


,-l 


u(t)  = R B (x(t) ,t)p(t)/2  . 


. th 


To^jJetermine  the  i component  of  u(t),  let  b^ 
denote  the  i column  of  the  matrix  B . Then 

T T T 


, th 


the  i component  of  B (x(t),t)p(t)  = b^p(t)  = p (t)b^  . 


The  argument  has  been  omitted,  but  b.  is  a function  of  x(t),t. 
Hence,  from  Equation  (A-6) , we  have  £hat 


_T 

P (t)b. 


Ui(t)  = " “27 


ii 


Since  Equation  (A-3)  holds,  Equation  (A-8)  holds  for 


-2r ..  < p (t)b.  < 2r.. 

ii  l ii 


A-l 


(A-l) 


(A-2) 


(A-3) 


(A-4) 


(A-5) 


(A-6) 


(A-7) 


(A-8) 


(A-9) 


A 


Suppose  now  that  the  control  is  saturated.  Then 
Equation  (A-5)  is  no  longer  valid,  and  Pontryagin's  minimum 
principle  must  be  applied.  The  'u'  part  of  H is 
_T  _ T _ 

u (t)Ru(t)  + p (t)B(x(t)  ,t)u(t)  . 


Assuming  that  the  components  of  the  control  u are  linearly 


independent,  u.(t)  is  to  be  chosen  so  as  to  minimize  the  u^ 
part  of  the  Hamiltonian: 

T 


th 


ui (c) [riiui/t)  + P (t)bi] 


Since  r^  is  positive  and  the  magnitude  of  u is  1,  then  the 
following  choices  of  u.(t)  will  yield  the  smallest  value  of 
the  Hamiltonian. 


u.(t)  = 


-1  if  p ( t)b . > 2r . . 

T 1 ~ 11 


+1  if  p ( t)b . < -2r . . 

l—ii 


is 


In  summary  then,  the  form  of  the  optimal  controller 

T 

-1 

T 


if  p ( t)b  . > 2r  . . 

l — li 


u*(t)  = < 


P (t)bi 


2r . . 

li 


if 


+1 


if  p (t)bi  <_  -2rii 


(A-10) 


(A-ll) 


(A-12) 


-2r . . < p (t)b  . < 2r  . . 

li  l li 


(A-13) 


A-2 
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Appendix  B 

FOUR  STATE  SIMULATION  LISTING 

I 


i 


DISTRIBUTION 


4 


ARMY 


Defense  Documentation  Center 
Cameron  Station 
Alexandria,  Virginia  22314 


No.  of  Copies 
12 


Commander 

ASD 

ATTN:  DRCPH-HFF,  Col.  Dobbs 
Wright-Patterson  Air  Force  Base,  Ohio  45433 


Commander 
Picatinny  Arsenal 

ATTN:  DRCPM-CAWS-PA , Co.  R.  Phillips 
Dover,  New  Jersey  07801 


LOCAL 

DRDMI-T 

DRDMI-TG 

DRDMI-TGN 

DRCPM-HF 

DRCPM-HFE 

DRCPM-CAWS-FO 

DRDMI-V 


Dr.  Kobler 
Mr.  Huff 
Mr.  Gambill 
Dr.  Pas trick 
Col.  Fiest 
Mr.  Comer 
Mr.  Keat 
Maj . Love 


A 


1 


1 


1 

1 

1 

5 

1 

1 

1 
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